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Motion  camouflage  in  a  stochastic  setting 


K.  S.  Galloway,  E.  W.  Justh,  and  P.  S.  Krishnaprasad 


Abstract —  Recent  work  has  formulated  2-  and  3-dimensional 
models  and  steering  control  laws  for  motion  camouflage,  a 
stealthy  pursuit  strategy  observed  in  nature.  Here  we  extend  the 
model  to  encompass  the  use  of  a  high-gain  pursuit  law  in  the 
presence  of  sensor  noise  as  well  as  in  the  case  when  the  evader’s 
steering  is  driven  by  a  stochastic  process,  demonstrating  (in  the 
planar  setting)  that  motion  camouflage  is  still  accessible  (in  the 
mean)  in  finite  time.  We  also  discuss  a  family  of  admissible 
stochastic  evader  controls,  laying  out  the  groundwork  for  a 
future  game-theoretic  study  of  optimal  evasion  strategies. 

I.  Introduction 

Motion  camouflage  is  a  stealthy  pursuit  strategy  which 
relies  on  minimizing  the  perceived  relative  motion  of  a 
pursuer  as  observed  by  its  prey.  The  phenomenon  of  motion 
camouflage  has  been  biologically  documented  for  visual 
insects  in  [16]  and  in  [11]  (based  on  [2]),  mathematically 
characterized  (e.g.,  [4]),  and  recently  analyzed  as  a  determin¬ 
istic  feedback  system  [9],  [13].  Furthermore,  a  geometrically 
indistinguishable  strategy  has  been  shown  to  be  used  by 
certain  echolocating  bats  intercepting  prey  insects  [3],  The 
analysis  of  a  feedback  law  for  motion  camouflage  has  been 
performed  in  both  the  planar  ([9])  and  three-dimensional 
settings  ([13]),  using  the  machinery  of  curves  and  moving 
frames  [1].  Close  parallels  have  been  shown  to  exist  between 
motion  camouflage  (which  is  rooted  in  biology),  and  the 
Pure  Proportional  Navigation  Guidance  (PPNG)  law  for 
missile  guidance  [12],  [15].  That  motion  camouflaged  pursuit 
appears  in  such  diverse  contexts  has  motivated  more  detailed 
study  of  such  aspects  as  sensory  feedback  delays  [14],  and 
the  stochastic  formulation  presented  in  the  present  work. 

Previous  work  in  [9]  and  [13]  made  use  of  natural  Frenet 
frames  [1]  to  describe  the  particle  trajectories  and  develop 
models  for  the  pursuer-evader  interaction  in  a  deterministic 
setting.  In  this  paper  we  make  use  of  the  planar  model 
to  investigate  the  impacts  of  stochasticity  such  as  sensor 
noise  and  evader  controls  driven  by  random  processes.  In 
the  biological  setting,  we  consider  possible  connections  with 
organisms  that  appear  to  use  stochastic  control  processes. 
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such  as  the  “run-and-tumble”  movement  exhibited  in  bac¬ 
terial  chemotaxis  (see,  e.g.,  [17]).  Many  species  of  bacteria 
use  this  type  of  stochastic  steering  control,  which  we  model 
as  a  continuous  time,  finite  state  (CTFS)  process  driven  by 
Poisson  counters.  (See  Section  V).  Other  types  of  erratic 
evasion  maneuvers  utilized  by  a  variety  of  insects,  birds 
and  fish  are  discussed  in  [5].  In  the  vehicular  setting,  we 
note  possible  applications  in  areas  such  as  aircraft-missile 
interactions,  considering  the  possibility  that  some  type  of 
stochastic  evasive  maneuver  may  in  fact  prove  most  effective 
against  an  inbound  weapon.  (We  do  not  consider  such  issues 
in  the  current  paper  but  plan  to  address  the  game-theoretic 
problem  in  future  work.) 

We  proceed  by  sketching  the  planar  pursuit-evasion  model 
as  well  as  some  of  the  fundamentals  of  motion  camouflage 
and  the  motion  camouflage  proportional  guidance  (MCPG) 
feedback  law  derived  in  [9].  In  Section  III  we  consider  the 
effects  of  sensor  noise,  and  in  Section  IV  we  address  motion 
camouflage  with  a  stochastically  steering  evader,  stating  and 
proving  a  proposition  that  the  MCPG  law  will  still  ensure 
attainment  of  motion  camouflage  in  the  mean  in  finite  time. 
After  presenting  some  specific  forms  of  admissible  stochastic 
controls  in  Section  V  and  simulation  results  in  Section  VI, 
we  conclude  by  discussing  directions  for  future  work. 

II.  Motion  camouflage  model 

Our  starting  point  is  the  deterministic  planar  motion 
camouflage  model  described  in  [9],  A  generalization  of 
this  deterministic  planar  model  may  be  found  in  [13],  and 
the  analysis  presented  here  can  be  generalized  to  three 
dimensions  using  the  same  techniques.  Because  here  we  wish 
to  focus  on  the  novel  stochastic  elements  being  introduced 
into  the  motion  camouflage  model,  we  restrict  discussion 
to  the  planar  setting.  Also,  to  streamline  the  discussion  we 
assume  constant-speed  motion  and  no  sensorimotor  feedback 
delays. 

A.  Trajectory  and  frame  evolution 

To  keep  the  discussion  as  self-contained  as  possible,  we 
reiterate  the  basic  planar  motion  camouflage  formulation  of 
[9].  Particles  moving  at  constant  speed  subject  to  continuous 
(deterministic)  steering  controls  trace  out  trajectories  which 
are  C2,  i.e.,  twice  continuously  differentiable.  Without  loss 
of  generality,  we  may  assume  that  the  pursuer  particle  moves 
at  unit  speed,  and  the  evader  particle  moves  at  speed  v  >  0 
(i.e.,  v  corresponds  to  the  ratio  of  speeds  of  the  pursuer  and 
evader). 
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Fig.  1.  Illustration  of  the  trajectories  and  natural  Frenet  frames  for  the 
planar  pursuer-evader  engagement.  (Figure  from  [9].) 

The  motion  of  the  pursuer  is  described  by 

fp  =  xp, 

=  Ypup, 

y  p  =  -Xpup,  (l) 

and  the  motion  of  the  evader  is  described  by 

re  =  isxe, 

±e  =  vy  eUe, 

ye  =  -vxeue,  (2) 

where  the  steering  control  of  the  evader,  ue,  is  prescribed, 
and  the  steering  control  of  the  pursuer,  up,  is  given  by  a 
feedback  law.  The  orthonormal  frame  {xp,  yp } ,  which  is  the 
planar  natural  Frenet  frame  for  the  pursuer  particle,  evolves 
with  time  as  the  pursuer  particle  moves  along  its  trajectory 
(described  by  rp).  Similarly,  {xe,ye}  is  the  planar  natural 
Frenet  frame  corresponding  to  the  evader  particle.  (In  the 
planar  setting,  the  natural  Frenet  frame  and  the  Frenet-Serret 
frame  coincide;  however  in  higher  dimensions  the  distinction 
is  critical  [1],  [8].) 

Here  we  assume  v  <  1,  so  that  the  pursuer  moves  faster 
than  the  evader.  Figure  1  illustrates  (1)  and  (2).  (As  noted  in 
[9],  the  controls  ue  and  up  are  actually  acceleration  inputs 
since  they  directly  drive  the  angular  velocity  of  the  particles. 
However,  the  speed  for  each  particle  remains  constant  since 
the  acceleration  inputs  are  constrained  to  be  applied  perpen¬ 
dicular  to  the  instantaneous  direction  of  particle  motion.) 

B.  Definition  of  motion  camouflage 

In  this  paper  we  focus  on  “motion  camouflage  with  respect 
to  infinity”,  the  strategy  in  which  the  pursuer  maneuvers  in 
such  a  way  that,  from  the  point  of  view  of  the  evader,  the 
pursuer  always  appears  at  the  same  bearing.  This  is  described 
in  1 9]  as 

rP  =  re  +  Xroo  (3) 

where  is  a  fixed  unit  vector  and  A  is  a  time-dependent 
scalar.  We  define  the  “baseline  vector”  as  the  vector  from 
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the  evader  to  the  pursuer 

r  =  rp  —  re,  (4) 

and  |r|  denotes  the  baseline  length.  Restricting  ourselves  to 
the  non-collision  case  (i.e.  |r|  ^  0),  we  define  w  as  the  vector 
component  of  r  which  is  transverse  to  r,  i.e. 


It  was  demonstrated  in  [9]  that  the  pursuit-evasion  system 
(1),(2)  is  in  a  state  of  motion  camouflage  without  collision 
on  a  given  time  interval  iff  w  =  0  on  that  interval. 

C.  Distance  from  motion  camouflage 
The  function 


describes  how  far  the  pursuer-evader  system  is  from  a  state 
of  motion  camouflage  [9],  [13].  The  system  is  in  a  state  of 
motion  camouflage  when  I  =  —1,  which  corresponds  to 
pure  shortening  of  the  baseline  vector.  (By  contrast,  T  =  0 
corresponds  to  pure  rotation  of  the  baseline  vector,  and  1  = 
+1  corresponds  to  pure  lengthening  of  the  baseline  vector.) 
The  difference  T  —  (—1)  >  0  is  a  measure  of  the  distance  of 
the  pursuer-evader  system  from  a  state  of  motion  camouflage. 

For  (6)  to  be  well  defined,  we  must  have  |r|  >0  as  well 
as  |r|  >0.  The  former  requirement  is  satisfied  by  assuming 
that  |r|  ytz  0  initially,  and  then  analyzing  the  engagement 
(for  finite  time)  only  until  |r|  reaches  a  value  ro  >  0  [9], 
[13],  The  latter  condition  is  ensured  by  the  assumption  that 
0  <  v  <  1,  since  |r|  >  1  —  v. 

D.  Feedback  law  for  motion  camouflage 

We  define  the  notation  q  to  represent  the  vector  q  rotated 
counter-clockwise  in  the  plane  by  an  angle  n/2  [9],  [13]. 
When  there  is  no  delay  associated  with  incorporating  sensory 
information,  we  define  our  feedback  law  as 


where  n  >  0  is  a  gain  parameter  [9],  [13].  However,  if  there 
is  a  delay  r  in  the  incorporation  of  sensory  information,  then 
we  substitute  up(t  —  r)  for  up  in  equation  (1). 

Observe  that  (7)  is  well  defined  since,  by  the  discussion 
in  the  previous  subsection,  |r|  /  0  during  the  duration  of 
our  analysis. 

E.  Deterministic  analysis 

The  key  results  for  the  deterministic  motion  camouflage 
feedback  system  are  presented  in  [9],  [13].  These  results, 
particularly  the  planar  result  in  [9],  are  the  inspiration  for 
the  calculations  below  in  Section  IV. 
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III.  Sensor  noise 

One  way  to  introduce  stochasticity  into  the  motion  cam¬ 
ouflage  system  is  through  sensor  noise  [9].  Figure  2,  which 
is  taken  from  [9],  illustrates  how  sensor  noise  can  be  incor¬ 
porated. 

In  figure  2(a),  the  dashed  dark  line  is  the  evader  trajectory 
and  the  solid  dark  line  is  the  corresponding  pursuer  trajectory 
under  control  law  (7),  but  with  noisy  measurements.  An 
M2- valued  independent  identically-distributed  (iid)  discrete¬ 
time  Gaussian  noise  process  with  zero  mean  and  covariance 
matrix  diag(cr2,  a2),  cr  =  ,15|r|,  is  added  to  the  true  relative 
position  r  at  each  measurement  instant.  Similarly,  an  Re¬ 
valued  iid  discrete-time  Gaussian  noise  process  with  zero 
mean  and  covariance  matrix  diag (a2,  a2),  a  =  .15 |r  | ,  is 
added  to  the  true  relative  velocity  r  at  each  measurement 
instant.  These  two  measurement  processes  are  then  used  by 
the  pursuer  to  compute  (7).  The  position  measurements  are 
superimposed  on  the  true  trajectory  of  the  evader,  and  it  can 
be  seen  that  the  absolute  measurement  error  decreases  as 
the  relative  distance  |r|  becomes  small.  In  this  simulation, 
the  gain  /i  =  1,  the  measurement  interval  is  approximately 
.5  time  units  (during  which  a  constant  steering  control  up 
is  applied),  the  pursuer  moves  at  unit  speed,  and  the  total 
simulation  time  is  approximately  1500  time  units. 

Figure  2(b)  shows  the  corresponding  cost  function  r(f) 
given  by  (6),  plotted  as  a  function  of  time.  The  difference 
between  T(t)  and  —1  measures  how  far  the  system  deviates 
from  a  state  of  motion  camouflage.  Part  of  this  deviation 
is  due  to  the  noisy  measurements,  and  part  is  due  to  the 
maneuvering  of  the  evader  (because  the  gain  /i  is  finite). 

IV.  Stochastically  steering  evader 


(a) 


Fig.  2.  Pursuit  with  noisy  sensory  measurements  available  to  the  pursuer: 
(a)  pursuer  and  evader  trajectories,  (b)  the  corresponding  distance  function 
r  as  a  function  of  time.  (Figure  from  [9].) 


A.  SDE  for  T 

Let  us  now  suppose  that  ue  is  not  a  deterministic  function 
of  time,  but  is  instead  driven  by  a  stochastic  process  (in 
a  way  we  will  make  precise  later).  Then  r  and  r  are  also 
stochastic  processes,  as  is  F  given  by  (6).  Analogous  to  the 
calculation  of  1  given  in  [9],  we  can  derive  the  following 
SDE  (Stochastic  Differential  Equation)  for  T  (see  Remark 
5): 


+  W\  t1  -  V^V  •  Xe))updt 

+  W\  Mw*1)  (l/  “  '  ^e))^Uedt, 

(8) 


which  is  supplemented  by  the  SDE  versions  of  (1)  and  (2), 
all  of  which  should  be  interpreted  as  stochastic  differential 
equations  of  the  Ito  type.  Here,  as  in  [9],  the  notation  q 
represents  the  vector  q  rotated  counter-clockwise  in  the  plane 


by  an  angle  tt/2.  Substituting  (7)  into  (8)  gives  (c.fi,  [9]) 


dF  =  - 


-  v{yip  •  xe)) 


j*f 

M. 


{v  -  (xp  •  xe))v2uedt. 

(9) 


Noting  that 


and  that  1  —  T2  >  0,  we  conclude  that 

dT<-(l-T2)  (1  -  z,(xp  •  xe))  -  j^j  dt 

Tj^plV1  -  r2)  (*z  -  (Xp  •  xe))zAre  dt.  (11) 

Futhermore,  as  in  the  deterministic  analysis  in  [9],  we  have 
the  following  inequalities: 

|xp  •  xe|  <  1,  and  1  —  v  <  |r |  <  1  +  v,  (12) 
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so  that 


dF  <  -(1  -r2) 


u2(l  +  u) 


1  -  V 

1  +  u 


i-r2 


l  +  U 


\dt. 


dt 


03) 


For  n  >  0,  we  can  define  constants  ?’o  >  0  and  Cq  >  0  such 
that 

'l  +  v\  (l  +  u 


A  = 


l-i/ 


r  o 


c0 


(14) 


and  thus 

A  > 

We  thus  have 


l  +  u\  ( l  +  u 
l-u 


+  C0  ,  V|r|  >  r0.  (15) 


dr  <  -(i-r2)c0^+  U^1_+V^  (Vi  -  r2)  h| dt,  (16) 


(1-*) 


for  all  | r |  >  tq. 


Because  the  initial  positions  |rp(0)|  and  |re(0)|  are  as¬ 
sumed  to  be  deterministic  (even  when  ue  is  stochastic),  it 
follows  that  |r (0) |  is  deterministic.  For  ro  <  |r(0)|,  and 
using 

|r(f)|  >  |r(0)|  -  (1  +  u)t,  (23) 


we  can  conclude  that  the  interval  [0,T),  where 


k(0)|  —  y0 

l  +  u 


>0, 


(24) 


is  an  interval  of  time  over  which  we  can  guarantee  that  |r|  > 
ro  (regardless  of  the  sample  path  of  ue). 

From  the  form  of  (22),  it  is  clear  that  by  choosing  c2 
sufficiently  large,  A'[F]  can  be  driven  to  an  arbitrary  negative 
value  at  time  T,  but  for  the  fact  that  (22)  is  only  valid  for 
E[  1  —  r2]  >  e.  Indeed,  for  any  ??  >  0  and 


C2  > 


1  +  E[Tq\ 
eT 


+  V, 


(25) 


B.  Bounds  for  E[T] 

The  next  step  is  to  take  expected  values  of  both  sides  of 
(16),  which  yields 


jE[T]<-c0E  [1-T2] 


'2(1 ±») 
-  U )2 


ue  |\/i-r2] , 
(17) 


provided  |r|  >  r0.  By  the  Cauchy-Schwartz  Inequality, 


by  a  contradiction  argument,  E[  1  —  F2 (ti)]  <  e  must  hold 
for  some  t\  €  [0,  T). 

C.  Statement  of  result 

Analogously  to  [9],  we  define  a  notion  of  (finite-time)  “ac¬ 
cessibility”  of  the  motion  camouflage  state  for  the  stochastic 
setting: 


E 


ue  x/l-T2 


<  VWfWE[i-r2}, 


from  which  it  follows  that 

jtE[T]  <  —cqE  [i  -  r2]  +  Ciy/E  [i-r2], 


(18) 

(19) 


provided  |r|  >  ro-  Here  we’ve  assumed  that  ue  has  a 
bounded  second  moment  (i.e.  E[u2]  <  u^laa,  for  some 
constant  umax  >  0)  an£l  we’ve  defined 


Cl 


u2{l  +  u) 
(1-u)2 


U max 


>  0. 


(20) 


We  can  now  show  that,  given  0  <  e  <<  1,  we  can 
choose  Co  (and  hence  /i)  sufficiently  large  so  as  to  ensure 
that  dE[T]/dt  <  0  for  E[  1  —  T2]  >  e  (provided  |r|  >  ro).  In 
particular,  choose  Co  >  C\/y/e.  Then  (19)  becomes 

l£[rls-£[1-r2]  (c“- 

<  —E  [l  —  r2]  c2 

<  -c2e,  (21) 


where  c2  =  Co  —  C\/yfe  >  0,  and  provided  jr|  >  ro.  Now, 
(21)  can  be  integrated  with  respect  to  time  to  give 


E[r}<-c2et  +  E[r0],  (22) 


Definition  1:  Given  the  system  (1)  -  (2),  interpreted  as  SDEs 
driven  by  random  processes  up  and  ue  having  (piecewise) 
continuous  sample  paths,  we  say  that  “motion  camouflage  is 
accessible  in  the  mean  in  finite  time”  if  for  any  e  >  0  there 
exists  a  time  t\  such  that  E[  1  —  F2(ti)]  <  e. 

Proposition  1:  Consider  the  system  (1)  -  (2),  with  control 
law  (7),  and  1  defined  by  (6),  with  the  following  hypotheses: 

(Al)  0  <  u  <  1  (and  v  is  constant), 

(A2)  ue  is  a  stochastic  process  with  piecewise  continuous 
sample  paths  and  bounded  first  and  second  moments 
(i.e.  3  constant  0  <  umax  <  oo  such  that  Vt  >  0, 
E[u2e]  <  u2max  and  | E[u  e]  |  ^3  llmaxf 
(A3)  ue  is  of  a  form  such  that  the  matrix  X  =  [xe  ye } 
evolves  on  SO( 2), 

(A4)  E[  1  -  T2]  >  0,  where  T0  =  r(0),  and 
(A5)  |r (0) |  >  0. 

Then  motion  camouflage  is  accessible  in  the  mean  in  finite 
time  using  high-gain  feedback  (i.e.,  by  choosing  y  >  0 
sufficiently  large.) 

Proof:  The  proof  is  along  the  lines  of  the  proof  of  Proposi¬ 
tion  3.3  in  [9]  for  the  deterministic  system. 

Without  loss  of  generality,  we  may  assume  that  E[  1  — 

r2]>e. 

Choose  ro  >  0  such  that  ?’o  <  |r(0)|.  Choose  c2  >  0 
sufficiently  large  so  as  to  satisfy 


as  long  as  E[  1  —  T2]  >  e,  where  To  =  r(0),  and  provided 
I r |  >  r0. 


C2  > 


l  +  u 


|r(0)|  -r0 


1  +  E[T0\ 


(26) 
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where  ?/  >  0,  and  choose  cq  as 

1  /  v2{\  +  v)  \ 

C°-C2  +  Ti  ( (i - v)2  Umax)  ‘  (27) 

Then  defining  /i  according  to  (14)  ensures  that  E[  1  — 
T2(ii)]  <  e  for  some  t\  £  [0,T),  where  T  is  given  by 
(24).  □ 

Remark  1:  Definition  1  above  does  not  distinguish  between 
motion  camouflage  with  decreasing  baseline  distance  (i.e., 
T  =  —  1)  and  motion  camouflage  with  increasing  baseline 
distance  (i.e.,  T  =  +1).  By  contrast,  the  definition  of  finite¬ 
time  accessibility  of  motion  camouflage  given  in  [9]  deals 
only  with  decreasing  baseline  distance. 

Remark  2:  Assumption  (A3)  equates  to  ensuring  that  the 
associated  vector  equation  evolves  on  a  circle.  This  is  dis¬ 
cussed  in  the  following  section. 

V.  Admissible  stochastic  controls 

In  considering  the  possible  families  of  stochastic  processes 
that  could  serve  as  controls  for  the  evader,  we  can  only 
select  such  controls  that  will  cause  the  “rotation  matrix” 
X  =  [xe  t/e]  to  evolve  on  SO( 2),  the  special  orthogonal 
group  in  two  dimensions.  For  a  stochastic  ue,  (2)  provides 
the  stochastic  differential  equation 

dXt  =  XtAuedt ,  (28) 

where  A  is  the  skew-symmetric  matrix  defined  by 

o')- 

Let  Xq  £  R2  and  define  xt  by  xj  =  Xq  Xt.  Then  we  have 


dxj  =  xj  Auedt 


dxt  =  AT  xtuedt. 


It  can  be  shown  (see,  e.g.,  [10])  that  Xt  evolves  on  SO( 2) 
if  and  only  if  (30)  evolves  on  a  circle. 

Proposition  2:  Let  the  stochastic  evader  control  ue  be 
defined  as  follows: 


Then  (32)  becomes 

dy  =  f(y)dt  +  g(y)dW.  (34) 

Letting  i/j(y)  =  xTx  and  using  Ito’s  rule  for  differentiating, 
we  have 


d(  xTx)  =  di/t(y) 


dij)  dxj)  1  /  d 2if> 

~dt  +  ^  f  +  2tr  \dydy 


d2^  t\  1 

dt 


J  L  «0)  J 
(\  2  o  1  r  °  o 
r  \[  0  o  J  [  0  P2{z) 
2x  i  r  o  i  1Tir 


=  2xT  ATxzdt 


where  the  last  step  follows  from  the  skew-symmetry  of  A1. 
Equation  (35)  implies  that  xTx  =  x^xo  for  all  times  t  >  0 
(i.e.,  (30)  evolves  on  a  circle),  and  therefore  (28)  evolves  on 
50(2).  □ 

Remark  3:  A  similar  result  can  be  proved  for  counter-driven 
stochastic  controls  of  the  form 

m 

dz  =  a(z,t)dt  +  /3i(z,t)dNj ,  z(0)  =  z0, 


where  Nt,  i  =  1,2 are  Poisson  counters  with  rates 
A i.  (Follow  the  previous  proof  and  use  Ito’s  rule  for  jump 
processes.)  □ 

We  note  the  following  specific  possibilities  for  stochastic 
controls: 


dz  =  a(z,t)dt  +  f3(z,t)dW,  2(0)  =  zq, 

ue  =  z,  (31) 

where  2  is  a  scalar  stochastic  process,  W(-)  is  standard 
Brownian  motion,  a:Rx[0,  oo)  — »  R  and  :  Rx  [0,  oo)  — > 
R  (and  suitable  technical  hypotheses  are  met).  Then  (28) 
evolves  on  50(2). 


Proof:  Grouping  (30)  and  (31)  and  dropping  the  time  sub¬ 
scripts  for  simplicity,  we  have 


(33) 


(a)  Brownian  motion.  Letting  a(z,t)  =  0  and  (3(z,t )  =  1  in 
(31)  results  in 

dz  =  dW,  2(0)  =  20,  ue  =  2,  (37) 

i.e.,  tte(-)  =  W(-).  In  this  case,  the  steering  control  would 
be  governed  by  sample  paths  of  a  Brownian  motion  process. 
However,  this  control  does  not  satisfy  assumption  (A2)  of 
Proposition  1  and  is  therefore  not  admissible. 

(b)  Brownian  motion  with  viscous  damping.  Let  a(z,  t )  = 
— dz  and  (3{z,  t)  =  a  for  constants  <5  >  0  and  er  £  R.  Then 
(31)  becomes 

dz  =  —Szdt  +  adW,  2(0)  =  20,  ue  =  2,  (38) 

which  is  better  known  as  the  Langevin  equation.  This  control 
satisfies  both  (A2)  and  (A3)  and  is  therefore  admissible. 
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(c)  “Run-and-tumble”  (bacterial  chemotaxis).  In  (36)  let 
a(z,  t)  =  0  and  define  the  Poisson  counter  rates  and 
coefficients  as  follows: 

Pi{z,t)  =  \z{z-  1), 

#2  (Z,t)  =  ~^Z(Z  +  1), 

@3 (Z,t)  =  (Z2  -  1), 

/34(z,t)  =  -(z2  -  1), 

Ai  =  A2  =  A  h, 

A3  =  A4  =  A  L.  (39) 

Then  (36)  becomes 

dz  =  7^z(z  —  l)dNi  —  7jz(z  +  l)dN2 

+(z2  -  1  )dN3  -  (z2  -  l)dN4,  2(0)  =  Zq, 

Ue  =  Z,  (40) 

and  ue  is  a  continuous  time,  finite  state  (CTFS)  process 
taking  values  in  the  set  {—1,0,1}.  Hence  ue  satisfies  (A2) 
and  (A3)  and  is  admissible  as  a  stochastic  control  for  the 
evader.  We  can  approximate  bacterial  chemotaxis,  the  “run- 
and-tumble”  control  used  by  certain  types  of  bacteria  to  move 
towards  food  sources,  by  choosing  A h  »  A l-  Under  this 
open-loop  control,  the  evader  will  move  primarily  in  straight 
paths  (ue  =  0),  making  occasional  random  short-duration 
turns  whenever  Poisson  counter  N3  or  N4  fires.  This  could 
also  be  implemented  as  a  closed-loop  control  by  feeding  state 
information  (such  as  range  to  the  pursuer)  back  to  the  counter 
rates  A h  and  A/,. 

Remark  4:  Note  that  the  control  uedt  =  dw  (i.e.,  ue  ~ 
“white  noise”)  is  not  a  permissible  control  for  the  evader, 
since  a  calculation  similar  to  (35)  yields 

d{  xTx)  =  xTAATxdt  =  v2xT  I  xdt,  (41) 

which  is  not  necessarily  zero,  and  therefore  X  =  \xe  ye] 
will  not  evolve  on  SO( 2).  □ 

Remark  5:  Under  assumptions  (A2)  and  (A3)  referred  to 
above  (we  are  specifically  interested  in  ue  processes  such 
as  (38)  and  (40)),  it  follows  that  for  each  path  of  ue,  the 
random  differential  equations  (1),(2)  with  control  (7),  have 
well-defined  local  pathwise  solutions  away  from  collisional 
states  r;,  =  re.  Applying  Ito’s  rule  to  the  ensemble  process 
(1)42), (7)  gives  us  (8). 

VI.  Simulation  Results 

The  following  simulation  results  demonstrate  the  effective¬ 
ness  of  the  pursuit  law  (7)  against  an  evader  using  a  “run- 
and-tumble”  steering  control  as  described  in  the  previous 
section,  confirming  the  analytical  results  presented  in  Section 
IV.  Each  simulation  is  based  on  the  same  parameters  but 
differs  by  the  ratio  of  the  Poisson  counter  rates  Al  and  A//. 
(Note  also  that  each  simulation  was  run  for  approximately 
250  time  units  in  steps  of  .1  time  units,  and  the  ratio  of 


Fig.  3.  (a)  Evader  trajectory  with  counter-driven  “run-and-tumble”  steering 
control  (dashed  dark  line),  and  the  corresponding  pursuer  trajectory  (solid 
dark  line)  evolving  according  to  (1)  with  control  given  by  (7).  In  this  case, 
\h  =  40 Af,.  (b)  The  corresponding  cost  function  T(t)  given  by  (6),  plotted 
as  a  function  of  time.  The  lighter  dashed  lines  correspond  to  a  small  value 
of  i±  while  the  darker  solid  lines  correspond  to  a  value  of  fi  which  is  three 
times  larger,  (c)  The  transient  behavior  of  T(f)  as  represented  by  an  initial 
time  interval  l/25th  the  duration  of  that  displayed  in  (b).  The  darker  line 
corresponds  to  the  higher  value  of  /i. 
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Fig.  4.  (a)  Pursuer  and  evader  trajectories  for  a  “run-and-tumble”  evader 

steering  control  with  A //  =  20 A  /_.  (b)  The  corresponding  cost  function 
r(f)  given  by  (6).  plotted  as  a  function  of  time.  (Note  that  the  transient 
behavior  of  T(t)  in  this  simulation  was  identical  to  that  from  figure  3  and 
was  therefore  omitted.) 


evader’s  speed  to  pursuer’s  speed  was  fixed  at  v  =  .9.) 
Figure  3(a)  shows  the  pursuer  and  evader  trajectories  for 
a  simulation  in  which  the  ratio  between  the  counter  rates 
is  very  large  (A//  =  10A/J  and  therefore  the  evader  makes 
fewer  maneuvers.  (The  lighter  lines  connecting  the  pursuer 
and  evader  at  regular  time  intervals  indicate  the  evolution 
of  the  baseline  vector  r.  If  the  system  (1),(2)  is  in  a  state 
of  motion  camouflage,  these  lines  will  be  parallel.)  Figures 
3(b)  and  3(c)  show  the  complete  and  transient  behavior, 
respectively,  of  the  cost  function  F(t)  given  by  (6).  (Each 
graph  shows  the  results  for  both  a  smaller  pursuit  feedback 
gain  /i  as  well  as  the  results  for  a  gain  three  times  larger.) 
Note  that  the  cost  function  is  driven  to  the  desired  value 
of  -1  (indicating  attainment  of  motion  camouflage)  with 
intermittent  spikes  corresponding  to  momentary  deviations 
from  the  motion  camouflage  state.  These  spikes  correspond 
to  time  instances  when  the  evader  executes  an  abrupt  turn. 

Figures  4  and  5  show  results  for  increasingly  larger  values 
of  A^  with  a  fixed  A#  (i.e.  higher  probability  of  evader 
maneuvering.)  As  demonstrated  in  figure  5(a),  increased 
evader  maneuvering  induces  more  frequent  steering  require¬ 
ments  for  the  pursuer,  indicating  that,  while  such  an  evasive 
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Fig.  5.  (a)  Pursuer  and  evader  trajectories  for  a  “run-and-tumble”  evader 

steering  control  with  A#  =  6.67 A^.  (b)  The  corresponding  cost  function 
T (t)  given  by  (6).  (c)  The  initial  behavior  of  T(t)  over  1/25 th  of  the 
simulation. 
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control  may  not  prevent  capture,  it  may  introduce  a  high 
steering/attention  cost  on  the  pursuer.  Note  from  figure  5(b) 
that  the  highly  erratic  evader  steering  control  produced  by 
large  values  of  A^  (i.e.  smaller  ratios  of  A h  to  A/,)  results 
in  frequent  deviations  from  motion  camouflage.  Figure  5(c) 
displays  the  initial  transient  behavior  of  r(t).  In  the  case  of 
the  larger  value  of  /i,  the  initial  behavior  of  r(i)  is  similar 
to  that  of  figure  3(c)  since  the  pursuer  is  able  to  maneuver 
into  a  motion  camouflage  state  prior  to  the  evader’s  first 
course  change.  For  the  smaller  value  of  //,  the  first  evader 
maneuver  occurs  while  r(i)  is  still  much  larger  than  -1, 
thereby  delaying  convergence  to  the  motion  camouflage  state. 

VII.  Directions  for  future  work 
In  both  the  deterministic  analysis  of  motion  camouflage 
in  [9],  [13]  and  in  the  stochastic  analysis  presented  here,  the 
speeds  of  both  the  pursuer  and  evader  have  been  constrained 
to  be  deterministic  (though  possibly  varying  in  [13]),  with 
bounds  to  ensure  that  the  evader’s  speed  is  sttictly  less  than 
that  of  the  pursuer.  Our  investigation  of  the  effects  of  stochas¬ 
tic  evader  behavior  could  therefore  be  extended  to  permit 
stochastic  variation  of  speed,  prompting  several  interesting 
questions.  While  the  current  work  has  demonstrated  that 
a  pursuer  can  always  attain  a  state  of  motion  camouflage 
if  the  evader’s  speed  is  deterministic  and  slower  than  the 
pursuer’s,  it  could  also  be  asked  whether  the  result  will  still 
hold  if  the  evader’s  speed  is  slower  only  on  average  or  if  the 
evader’s  speed  is  governed  by  a  stochastic  process  with  a 
particular  mean  function.  Additionally,  in  our  discussion  of 
motion  camouflage  in  both  the  deterministic  and  stochastic 
settings,  we  have  only  permitted  the  evader  to  use  an 
open-loop  strategy  rather  than  a  feedback  strategy  (whereas 
the  pursuer  uses  a  feedback  strategy).  In  the  language  of 
biologists,  such  engagements  would  more  properly  be  called 
“pursuer-pursuee”  engagements,  rather  than  pursuer-evader 
engagements.  The  problem  could  be  reformulated  in  the 
context  of  a  differential  game  (as  described  in  [6],  [7]) 
in  which  the  pursuer  would  adopt  a  strategy  to  intercept 
the  evader  (or  maximize  stealth),  and  the  evader  would 
adopt  a  strategy  to  elude  the  pursuer  (or  maximize  pursuer 
“visibility”).  These  types  of  feedback  strategies  could  be 
implemented  in  the  stochastic  setting  by  using  feedback  to 
regulate  parameters  of  the  stochastic  process  (e.g.  varying  the 
rates  of  the  evader’s  Poisson  counters  as  a  function  of  pursuer 
range.)  In  the  context  of  this  type  of  differential  game,  a 
family  of  varying  pay-off  functions  and  control  constraints 
could  be  used  to  explore  possible  connections  to  behaviors 
observed  in  biology.  The  run-and-tumble  motion  of  bacteria 
and  the  skittish  darting  of  fish  may  be  consistent  with  such 
a  formulation. 
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